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Abstract 

We establish a new outer bound for the capacity region of product broadcast channels. This 
outer bound matches Marton's inner bound for a variety of classes of product broadcast channels 
whose capacity regions were previously unknown. These classes include product of reversely 
semi-deterministic and product of reversely more-capable channels. A significant consequence 
of this new outer bound is that it establishes, via an example, that the previously best known 
outer-bound is strictly suboptimal for the general broadcast channel. Our example is comprised 
of a product broadcast channel with two semi-deterministic components in reverse orientation. 
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1 Introduction 



The broadcast channel refers to a communication scenario where a single sender wishes to com- 
q ■ municate (possibly different messages) with multiple receivers. We consider a simple setting of the 

problem where the sender X, who has messages M\,M2, wishes to communicate message Mi to 
receiver Y and M2 to receiver Z over a noisy discrete memoryless broadcast channel q(y,z\x). A 
set of rate pairs (R\, R2) is said to be achievable for this broadcast channel, q, if there is a sequence 
. of codebooks, each consisting of: 

\ • an encoder at the sender that maps the message pair (777-1,7712) into a sequence X n 

• a decoder at receiver Y that maps the received sequence Y n into an estimate M\ of its 
intended message Mi, and 

• a decoder at receiver Z that maps the received sequence Z n into an estimate M2 of its intended 
message M2 

rS 

such that P(Mi 7^ Mi),P(M2 7^ M2) — > as 77 — > 00, when the messages Mj,M2 are uniformly 
distributed in [1 : 2 nRl ] x [1 : 2 ni?2 ]. The capacity region is the closure of the set of all achievable 
rate pairs. An evaluable characterization of this capacity region is a well known open problem. 

The best known inner and outer bounds for the capacity region of a general two-receiver discrete- 
memoryless broadcast channel with private messages are the following: 

• Inner bound: (Marton [Mar 79] ) The union of rate pairs (i?i,i?2) satisfying the inequalities 

Ri < I(U, W; Y) 
R2 < I(V, W; Z) 

Rt + R 2 < mm{I(W; Y),I(W; Z)} + I(U; Y\W) + I(V; Z\W) - I(U; V\W). 

over all (U, V, W, X) : (U, V, W) — > X — > (Y, Z) forms a Markov chain constitutes an inner 
bound to the capacity region. Further to compute this region it suffices [GA09] to consider 
\U\,\V\ < \X\,\W\ < \X\ +4. 
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Outer bound: (UV outer bound [El 791INE07] ) The union of rate pairs (i?i,i?2) satisfying the 
inequalities 



Ri<I(U;Y) 

R 2 <I(V;Z) 
Ri + R2< I{U;Y)+I(X;Z\U) 
R1+R2 <I(V;Z) + I(X;Y\V) 

over all (U, V, X) : (U, V) — ¥ X — > (Y, Z) forms a Markov chain constitutes an outer bound to 
the capacity region. Further to compute this region it suffices to consider \U\, \V\ < \X\ + 1. 

Remark 1. Although there have been outer bounds subsequent to this outer bound, it was shown 
recently [NailO] that all these were equivalent to the UV-outer bound. 

For all the classes of channels for which the capacity region was previously established the inner 
and outer bounds coincided. In a sequence of results that was established in the last few years it 
has been shown |NW08t IGA09} IJNIOl IGNSWIOj that the inner and outer bounds can indeed be 
different for various channels. Hence there are three possibilities: 1) UV-outer bound is optimal, 
2) Marton's inner bound is optimal, or 3) Neither is optimal in general. 

In this paper we show that the UV-outer bound is strictly sub-optimal. One of the main 
contributions of this paper is a new outer bound for product broadcast channels. This outer 
bound matches Marton's inner bound (and hence is tight) in many instances including classes 
whose capacity regions were previously unknown, e.g. product of reversely semi-deterministic and 
reversely more-capable channels. We then construct a reversely semi-deterministic channel where 
the UV-outer bound is strictly weak. 

Definition 1. A broadcast channel q(y, z\x) is said to be a product broadcast channel if X = 
(Xi,X 2 ), y = (yi,y2),Z = {2i,Z 2 ) and q(yi,y2,z 1 ,z 2 \x 1 ,x 2 ) = qi(yi, zi\x 1 )q 2 (y 2 , z 2 \x 2 ). Here we 
denote q = qi x q 2 . 

Definition 2. A product broadcast channel q = qi x q 2 is said to be reversely semi-deterministic 
if the channel to one of the receivers in the first component is deterministic, and the channel to the 
other receiver in the second component is deterministic. That is either both c\i(yi\xi),c\ 2 (z 2 \x 2 ) € 
{0, 1} or both qi(zi|a:i),q2 (2/2|2:2) G {0, 1}. 

Definition 3. A product broadcast channel q = qi x c\ 2 is said to be reversely more capable if one 
of the following two holds: 

• I(X 1 ;Y 1 ) > /(X i; Zi), Vp(xi), I{X 2 -Z 2 ) > I(X 2 ;Y 2 ), Vp(x 2 ), 

. 7(Xi;Zi) > IiX^Yi), Vp{xi), I{X 2 ;Y 2 ) > I(X 2 ;Z 2 ), Vp(x 2 ). 

The capacity results originated from the consideration of the 2-letter version of Marton's inner 
bound and particularly on the sum-rate. Consider a broadcast channel q(y, z\x). Marton's sum-rate 
for this channel is given by 

SR MIB (q)= max mm{I(W;Y)J(W;Z)} + I(U;Y\W)+I(V;Z\W)-I(U;V\W). (1) 

p(u,v,w,x) 
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Consider the product channel q x q obtained by choosing two identical copies of the original channel 
q(y,z\x). If we could give an example where SRmib(({ x q) > 2SRmib(c\), then we could indeed 
conclude that Marton's inner bound is strictly sub-optimal in general. 

On the other hand, consider a class C of channels that is closed under the product operation; 
i.e. if qi, q2 € C then qi x q 2 <G C. If one is able to show that SRmib{(\ x q) = 2SRmib(<\), Vq G C; 
then one can conclude that Marton's sum rate is optimal for channels in C. This is because on 
the one hand the normalized n-letter Marton's sum rate -SRMiB(q&>n) SR*, the optimal sum- 
rate, as n converges to infinity. Here we have used the notation q<g) n to denote Yl7=i ^{Vii z i\ x i)i 
the n-fold product channel. On the other hand, SRmib((\ X q) = 2SRmib(<\), Vq € C shows that 
SR MIB (q) = jkSR M iB(q^) yk > 0. 

See a recent related article |GGNYlTj for elaborated discussion of this 2-letter approach. This 
approach is only superficially different from the usual technique of proving optimality, where one 
starts from Fano's inequality and directly identifies auxiliary random variables for a class to show 
the optimality. While these two approaches are identical, we believe that this alternate viewpoint 
has helped us get our new results. We also believe that the true utility of this approach however 
lies in the ability to test (at least via numerical simulations) whether a particular scheme is optimal 
or not. 

The outline of this paper is as follows. Our first goal is to construct an example where the 
sum-rate given by the UV outer-bound is loose. Next, we provide a new outer bound for the 
capacity region product channels that is optimal for the product of reversely semi-deterministic 
and reversely more-capable channels. Lastly, we study some properties of the A-sum rate that 
played an important role in the construction of the example where the outer-bound was loose. 

2 The UV outer bound is not tight 

From UV outer bound the sum-rate of a general broadcast channel can be bounded from above by 
SR uv (q) = max mm{I(U; Y) + I(V; Z),I(U; Y) + I(X; Z\U),I(V; Z) + I(X; Y\V)}. (2) 

p(u,v,x) 

In this example we will demonstrate a product of reversely semi-deterministic channel, q = qi x q 2 , 
such that the optimal sum-rate SR*(q\ x q 2 ) satisfies 

SR M iB(qi x H2) = SR* (qi x q 2 ) < SRuv(s\\ X <\2)- 

This unequivocally shows that the UV outer bound is strictly suboptimal for the general broadcast 
channel. 

Remark 2. Even if one were to consider the best outer bound with a common message requirement, 
the UVW outer bound [NailOj . the fact that we are showing that the sum-rate is strictly weak for 
the UV outer bound immediately implies the strict sub optimality of the UVW outer bound as 
well. Reason: Its projection on the plane Rq = (which becomes the UV outer bound) is strictly 
suboptimal. 

The flow of this section is as follows: in order to compute the sum-rate for product of reversely 
semi-deterministic channels, we first define a A-parametrized family of functions, called the A-sum 
rate, that are related to the sum rate given by Marton's inner bound. The fact that A-sum rate 
factorizes, in the sense to be defined later, for product channels of this type is used to show that 
Marton's sum-rate is optimal for this class of product channels. The last step is to show that the 
UV outer bound is strictly suboptimal over this class of broadcast channels. 
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2.1 A-sum rate 

Define a A-parametrized family of functions that is related to the sum rate given by Marton's inner 
bound as follows: 

X-SR(q,p(u, v, w, x)) = XI(W; Y) + (1 - X)I(W; Z) + I(U;Y\W) + I(V; Z\W) - I(U; V\W). 
We further define 

X-SR(q,p(x)) = max X-SR(q,p(u, v, w, x)), and X-SR(q) = max X-SR(q,p(u, v, w, x)). 

p(u,v,w\x) p(u,v,w,x) 

Observe that SRmib = ™&Xp{u,v,w,x) mm Ae[o,i] X-SR(q,p(u, v, w, x)). The following claim allows us 
to shift the discussion from the sum-rate to A-sum rate, and then return to the sum-rate at a later 
point to complete the argument. 

Claim 1. The following min-max theorem holds: 

max min X-SR(q,p(u, v, w, x)) = max min max X-SR(q,p(u, v, w, x)) 

p(u,v,w,x) Ag[0,l] p(%) As[0,l] p(u,v,w\x) 

= min max X-SR(q,p(u, v, w, x)). 

Ag[0,l] p(u,v,w,x) 

This implies that the sum-rate of Marton's inner bound can be calculated using any of the three 
above expressions. 

Proof The fact that 

max min X-SR(q,p(u, v, w, x)) = min max X-SR(q,p(u, v, w, x)) 

p(u,v,w,x) As[0,l] Ae[0,l] p(u,v,w,x) 

was established in section 3.1.1 of [GEAlOj . For completeness we present a (slightly different) proof 
in the Appendix which can be considered as an application of a min-max theorem of Terkelsen 
[Ter72j . The argument in the Appendix can also be used in other instances where a max-min 
occurs, such as compound channels. □ 

2.2 Factorization of A-sum rate 

Given two channels qi(yi, q2(y2, Z2\x2), we say that X-SR factorizes if 

X-SR(q! x q 2 ) = A-Sfl(qi) + X-SR(q 2 ). (3) 

We have discussed the factorization of A-sum rate in a recent related article |GGNYlT] . We need 
a shortened version of lemma from that article: 

Lemma 1. The X-SR(q\ x q 2 ) factorizes if any one of the four channels X\ — > Y\; X\ — > Z\;X2 — > 
Y2 or X2 — > Z2 is deterministic. 

2.3 Marton's sum-rate is tight for product of reversely semi-deterministic chan- 
nels 

Claim 2. Marton's sum rate is optimal for product of reversely semi- deterministic channels. More- 
over the sum rate of such a product channel qi x q 2 is given by 

min (X-SR(q 1 )+X-SR(q 2 )). 

AG[0,1] 
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Proof. Take two semi-deterministic channels q\(y\, z\\xi) and q 2 (y 2 , z 2 |x 2 ) where Y\ is a determin- 
istic function of X\ and Z 2 is a deterministic function of X 2 . It is well-known (also see proof of 
Lemma 1 in [GGNYlT] ) that the optimal sum-rate SR*{(\i x q 2 ) satisfies 

SR*(c\i x q 2 ) < liminf-S-Rfqi <g> n xq 2 (g> n ). 

n— »oo n 

Consider the n-letter A-sum rate of the product channel qi x q 2 . Using Lemma [1] the n-letter 
product channel factorizes into A- sum rate of two n-letter sub channels. Each term them again 
factorizes by repeated application of Lemma [TJ More precisely, 

X-SR(q 1 ® n xq 2 ® n ) = A-S7?(qi<g> n ) + A-S7?(q 2 ® n ) 
= n- X-SR(q 1 ) + n- X-SR(q 2 ). 

Marton's inner bound sum rate for the n-letter of the product channel qi x q 2 is equal to 

min (\-SR(qi <g> n xq 2 <g> n ))- 
ag[o,i] v ! 

We can write the above expression as 

n- min (X-SR(qx) + X-SR(q 2 )) . 

AG[0,1] V ' 

Therefore, the actual sum rate satisfies 

SiT (qi x q 2 ) < min l\-SR(q x ) + X-SR(q 2 )) . 

Ae[o,i] v ' 

On the other hand, this sum rate is achievable since it is equal to the single letter Marton's 
inner bound for qi x q 2 , i.e. 

SiT (qi x q 2 ) = min \-SR(q x x q 2 ) = min (A-S-R(qi) + A-Si?(q 2 )). 

Ae[o,i] ag[o,i] 

□ 

2.4 The UV outer bound is strictly suboptimal 

Claim 3. Consider the reversely semi-deterministic channel in FigureUi Assume that the transition 
probabilities are uniform across the possible outputs, i.e the red edges have a probability | in the 
first component and the blue edges have a probability | in the second component. Then Marton's 
sum rate (the optimal sum rate) is given by | = 3 — ^ , while the UV sum-rate is at least by 3 — . 

Proof. We begin by showing that Marton's sum rate (the optimal sum rate) is given by |. Claim 
[2] shows that the sum rate of qi x q 2 is 

min (X-SR(qi) + X-SR(q 2 )). 

Ae[o,i] 

The result of Appendix IB. II implies that for any A G [0,1], A-Si?(qi) is equal to \-SR(qi,u(xi)) 
where u is the uniform distribution on X\. A similar statement holds for A-Si?(q 2 ). Therefore the 
sum rate of qi x q 2 is equal to 

min (\-SR(q 1 ,u(xi))+X-SR(q2,u(x 2 ))). (4) 

Ae[o,i] 
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Figure 1: A reversely semi-deterministic channel 



By symmetry, X-SR((\2,u(x2)) = (1 — X)-SR(c[i, u(x±)). Therefore we can express the sum rate as 

min (X-SROauufx!)) + (1 - XySRfa, u(xi))) . 
ag[o,i] 

In Appendix IB. 21 we show that X-SR(q\, u(xi)) is equal to 

A-Si^u^i)) = 



|-|A AG [0,|] 
I AG [1,1] 



Substituting this function into @ we see that the minimum occurs at A = 0.5 and the optimum 
sum-rate is equal to |. 

To compute a lower bound on the UV sum-rate, let p(xi,X2) = u{x\)u{x2)-, i.e. independent 
uniform distribution on X\ and X2. We define Ui,V\, Xi,U2,V2, X2 having a joint distribution of 
the form p(ui, v\, x\)p(u2, V2, X2) as follows. Let U\ = Y\ and p(u2,X2) to satisfy 

P(A 2 = 1\U 2 = 1) = P(X 2 = 3\U 2 = 1) = L 



P(X 2 = 2\U 2 = 1) = P(X 2 = 4\U 2 = 1) 

1 
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P{U 2 = 1) = P(U 2 = 2 
Similarly, let V 2 = Z2 and p(v\,xi) to satisfy 

P(Xi = l|Vi = 1) = P(X 1 = 3\Vi = 1) 

P(Xi = 2\Vi = 1) = P(Xi = 4|Vi = 1) 

P(y 1 = l)=P(Vi=2) = i 
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Let Qi and Q 2 to binary random variable be mutually independent of each other, and of 
U 1 ,Vi,X 1 ,U 2 , V 2 ,X 2 . Furthermore assume that P(Q 1 = 0) = P(Q 2 = 0) = f . Define V{ and U' 2 as 
follows: When Qi = set V[ = V\ and else set V{ = X\. When Q 2 = set = ^2 and else set 
U' 2 = X 2 . Lastly set V x = (V[, Qi) U 2 = (U 2 , Q 2 ). 

We consider the UV region for the choice of (Ui,U 2 ), (Vi,V 2 ), (Xi,X 2 ). Note that 

Ri < I(Ui,U 2 ;Yi,Y 2 ) 
= I{U X ;Y{) + I{U 2 ;Y 2 ) 

= H(Yi) + P(U 2 ; Y 2 ) + \l{X 2 ; Y 2 ) 

4 1 1 

= 1 + H 1 

5 3 5 

22 



15 



Similarly, one can show that 



R2<I(V 1 ,V 2 ;Z 1 ,Z 2 ) 
_ 22 
~ 15' 

The sum rate constraint on R\ + R 2 is as follows: 

Ri + R 2 < I{U U U 2 \Y X ,Y 2 ) + I(X 1 ,X 2 ; Z u Z 2 \Ut, U 2 ) 

= /(*7i;>i) + I(X i; Z l \U 1 ) + I(U 2 ;Y 2 ) + I(X 2 ;Z 2 \U 2 ) 

= H(Y 1 )+I{X 1 ;Z 1 \Y 1 ) + p{U 2 ;Y 2 ) + ^/(X 2; y 2 ) + ^H(Z 2 \U 2 ) 

2 4 11 4 

= 1 + - + H 1 + --1 

3 5 3 5 5 

44 
~ 15' 

Similarly, one can show that 

Ri + R2 < I{Vi , V 2 ; Zi , Z 2 ) + J(X! , X 2 ; Yi , Y 2 \ V x , F 2 ) 
44 
~ 15" 

Therefore the point (Ri,R 2 ) = (||,||) is in this region. Hence the UV sum rate is at least 
jt? = 3 — . Thus for the product channel under consideration 

8 44 

- = SR M ib(<\i x q 2 ) = SR* {qi x q 2 ) < — < SR UV {qi X q 2 ). 

3 15 

This shows that the UV outer bound is strictly suboptimal in general. □ 



3 Capacity regions for classes of product broadcast channels 

In this section we establish the capacity region for classes of product broadcast channels. Indeed we 
consider a more general setting where in addition to the private messages M\ , M 2 that the receivers 
Y and Z wish to decode, the receivers also wish to decode a common message Mq. Hence we are 
interested in the achievable rate triples (Rq, R±, R 2 ). The capacity region is defined in a similar 
fashion as in the case without common message. 
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3.1 An outer bound for product channels 

We present a new outer bound for the product of two broadcast channels. The manipulations 
here are inspired by the manipulations that establish Lemma 1 in [GGNYlT] . This outer bound 
matches the capacity region for a variety of product channels, including product of two reversely 
semideterministic and product of two reversely more-capable channels. Hence, from Claim [31 it 
follows that this is a strictly better bound for product broadcast channels as compared to the UVW 
outer bound. 

Claim 4. Given a product channel q (2/12/2, z\z 2 (^1^2) = qi(2/i> ^iI^i) x 12(2/2, 2^2 1 x 2 ), the union over 
all pi(wi, v 1, «i, X\)p2(w2, v 2 , U2, x 2 ) of triples (7?o, R\, R2) satisfying 

R < min{I(Wi; YO + I(W 2 ;Y 2 ), I{W X ;Z X ) + I(W 2 ; Z 2 )} 
R, + Ri< mm{I(Wi; Y x ) + I{W 2 ; Y 2 ), I(W X ; Z x ) + I(W 2 ; Z 2 )} + I{U X ; Yi\Wi) + I(U 2 ;Y 2 \W 2 ) 
R + R 2 < min{7(Wi; Yi) + I(W 2 ; Y 2 ), I(W X ; Z x ) + I{W 2 ; Z 2 )} + I(V X ;Z X \W X ) + I(V 2 ; Z 2 \W 2 ) 
R +R 1 +R 2 < min{7(Wi; Y) + I(W 2 ; Y 2 ), I{Wr,Z r ) + I{W 2 ; Z 2 )} 
+ I(U 2 ;Y 2 \W 2 ) + I(X 2 ; Z 2 \U 2 , W 2 ) 

+ min { I{U X ; Y 1 \ W 1 ) + I{X X ; Z x \ U x , W x ) , /( V x ; Z x \ W x ) + 1^ ; Y 1 \ V x , W x ) } , 
R + R l + R 2 < min{7(iy i; Y) + I{W 2 ; Y 2 ), I{W 1 ;Z 1 ) + I{W 2 : Z 2 )} 

+ min {I(U 2 ; Y 2 \W 2 ) + I(X 2 ; Z 2 \U 2l W 2 ), I(V 2 ; Z 2 \W 2 ) + I(X 2 ; Y 2 \V 2l W 2 )} 
+ I(V 1 ;Z 1 \W 1 ) + I(X 1 ;Y 1 \V 1 ,W 1 ), 

forms an outer bound to a product broadcast channel. 

Remark 3. One can interchange the roles of Y2 and Z\ with Z2 and Y\ respectively to get another 
bound, and we can take the intersection of these two regions as an outer bound. 

Proof. The proof of this outer bound is given in the Appendix O □ 

Remark 4. The above outer bound is strictly sub-optimal. To see this first note that when one of 
the product channels is trivial, this outer bound does not give us anything beyond the UVW-outer 
bound [NailOj . Now, consider a product of three channels, first one is trivial, the collection of two 
and three forms a reversely semi-deterministic pair. The new outer bound reduces to the UVW 
bound on the reversely semi-deterministic, and therefore it is strictly sub-optimal. An interesting 
open question would be to write a outer bound for a general broadcast channel that is at least as 
good as the new outer bound for the product broadcast channel. 

3.1.1 An achievable region for a product broadcast channel 

Given a product channel q(yi?/2, z\Z2\x\X2) = q(j/i, zi\xi)c\(y2, 2:2^2) the union of rate triples satis- 
fying 

Ra < min{7(Wi; Y) + I(W 2 ; Y 2 ), 7(Wi; Z x ) + 7(W 2 ; Z 2 )} 
Ro + Ri< 7(Wi;Yi) + 7(W 2 ;Y 2 ) + 7(C7i;Yi|W r i) + 7(C7 2 ;Y 2 |W 2 ) (5) 
Ro + R2< 7(Wi; Zx) + 7(W 2 ; Z 2 ) + I{Vx]Zx\W x ) + I{V 2 ; Z 2 \W 2 ) 
Ro + R x + R 2 < min{7(Wi; Yi) + I{W 2 ; Y 2 ), I(Wx; Z x ) + 7(W 2 ; Z 2 )} 
+ I(Ux;Y x \Wx)+I(U 2 ;Y 2 \W 2 )+I{Vx;Zx\Wx) 
+ 7(Y 2 ; Z 2 \W 2 ) - I{Ux\V x \Wx) - I{U 2 ; V 2 \W 2 ). 

over all pi(wi,vi,ui,xi)p2(w2,V2,U2,X2) constitutes an inner bound to the capacity region. The 
achievablility of these points are immediate from Marton's inner bound by letting U = (U±, U2), V = 
(Vi,V 2 ),W = (Wi,W 2 ) and p(u,v,w) ~ pi(wi, Vi, ui, xi)p 2 (u)2, v 2 , u 2 , x 2 ). 
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3.2 Capacity regions for new classes of product broadcast channels 

Theorem 1. The capacity region for a product of reversely semi- deterministic (say, channels X\ — > 
Yi,X2 — > Z2 are deterministic) broadcast channel is given by the union of rate triples satisfying 

R < min{/(Wi; Fi) + I(W 2 ; Y 2 ),I(W X ; Z x ) + I(W 2 ;Z 2 )} 
Rv + Ri< I(Wi;Yl) + I(W 2 ; Y 2 ) + H{Y X \W X ) + I(U 2 ;Y 2 \W 2 ) 
Ro + R 2 < I(W X ;Z X ) + I(W 2 ; Z 2 ) + I(V X ;Z X \W X ) + H(Z 2 \W 2 ) 
R a + R 1 +R 2 < min{/(Wi; Y x ) + I{W 2 ;Y 2 ), I(W X ;Z X ) + I(W 2 ; Z 2 )} 

+ I(V X ;Z X \W X ) + H(Y X \V X ,W X ) + I{U 2 ;Y 2 \W 2 ) + H(Z 2 \U 2 , W 2 ) 

over all pi(wi,vi,xi)p 2 (w 2 ,U2,X2)- 

Proof. The achievability is immediate by setting U\ = Y\ and V 2 = Z2 in ([5]). Note that these 
two choices of auxiliary random variables are possible since channels X± — > Y±,X 2 — > Z2 are 
deterministic. 

The converse is also immediate from the outer bound in[H Observe that for any p x {w\ , v\,u\,x\), 
P2 (w 2 ,v 2 ,U2,x 2 ) we have 

/(E/i;yi|Wi) < HiY^Wt), I(V 2 ;Z 2 \W 2 ) < H(Z 2 \W 2 ) 

and each of the two sum-rate terms is bounded by 

mm{I{Wx;Y x ) + I(W 2 ; Y 2 ), I(Wr, Z x ) + I(W 2 ; Z 2 )} + I(V X ; Z X \W X ) 
+ H{Y 1 \V u Wi) +I(U 2 ;Y 2 \W 2 ) + H(Z 2 \U 2 ,W 2 ). 

Thus the outer bound is contained in the inner bound (and hence they coincide). □ 

Theorem 2. The capacity region for a product of reversely more-capable (say, receiver Z\ is more 
capable than Y x , and receiver Y2 is more capable than Z2) broadcast channel is given by the union 
of rate triples satisfying 

Ro < min{I(W x -,Y x )+I{W 2 -,Y 2 ),I(Wr,Z x )+I(W 2 -,Z 2 )} 
Ro + Ri< min{I(W x ; Y x ) + I(W 2 ;Y 2 ),I(W X ;Z X ) + I(W 2 ; Z 2 )} + I(U X ;Y X \W X ) + I(X 2 ;Y 2 \W 2 ) 
R + R 2 < min{I(W x ;Y x ) + I(W 2 ;Y 2 ),I(W X ; Z x ) + I(W 2 ; Z 2 )} + I(X X ; Z X \W X ) + I(V 2 ; Z 2 \W 2 ) 
Ro + R x +R 2 < min{I(W x ; Y x ) + I(W 2 ;Y 2 ), I(W X ;Z X ) + I(W 2 ; Z 2 )} + I(X 2 ;Y 2 \W 2 ) 

+ min {I(U X ; Y X \W X ) + I(X X ;Z X \U X ,W X ),I(X X ; Z X \W X )}, 
Ro + Ri+R 2 < min{I(W x ; Y x ) + I(W 2 ;Y 2 ), I(W X ;Z X ) + I{W 2 ; Z 2 )} 

+ min {I(X 2 ; Y 2 \W 2 ) , I (V 2 ; Z 2 \W 2 ) + I(X 2 ; Y 2 \V 2 , W 2 )} + I(X X ; Z X \W X ), 

over all pi{w\, v 1, xi)p 2 (^2, U2, ^2)- 

Proof The achievability is immediate by setting W{ = (Ui, W\), U[ = $,V{ = X x and W' 2 = 
(V2,W2),U2 = X 2 ,V 2 = in ([5]). Plugging these choices into ([5]) we obtain that one can achieve 
rate triples satisfying 

R < min{I(U x , W X ;Y X ) + I(V 2 , W 2 ;Y 2 ), I{U X , W x ; Z x ) + I(V 2 , W 2 ; Z 2 )} 
Ro + Ri < I(U X> W X ;Y X ) + I(V 2 , W 2 ;Y 2 ) + I(X 2 ; Y 2 \V 2 ,W 2 ) 

= I(W X ;Y X ) + I{W 2 ;Y 2 ) + (U X ;Y X \W X ) + I(X 2 ;Y 2 \W 2 ) 
Ro + R 2 < I{U X ,W X ;Z X ) + I(V 2 ,W 2 ; Z 2 ) + I{X X ; Z X \U X , W x ) 

= I(W X ;Z X ) + I(W 2 ; Z 2 ) + I{X X -Z X \W X ) + I(V 2 ; Z 2 \W 2 ) 
R + R x +R 2 < imn{I(U x ,W x ;Y x ) + I(V 2 , W 2 ;Y 2 ), I(U X ,W X ;Z X ) + I(V 2 , W 2 ; Z 2 )} 
+ I(X 2 ;Y 2 \V 2 ,W 2 ) + I(X X ; Z X \U X ,W X ). 
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The last sum-rate term can be split into two terms as follows 



Ro + R x + R 2 < I(U X ,W X ;Y X ) + I(V 2 , W 2 ; Y 2 ) + I(X 2 ; Y 2 \V 2 , W 2 ) + I(X X ;Z X \U X ,W X ) 

= I(W X ;Y 1 )+I(W 2] Y 2 ) + I(X 2 ;Y 2 \W 2 ) + I(U 1 ;Y 1 \W 1 ) + I(X 1] Z X \U 1 ,W 1 ) 

Ro + Ri+R 2 < I{U X ,W X ;Z X ) + I(V 2 , W 2 : Z 2 ) + I(X 2 ;Y 2 \V 2 , W 2 ) + I(X X ; Z X \U X ,W X ) 

= I{W X ;Z X ) + I{W 2 : Z 2 ) + I(X X ; Z X \W X ) + I(V 2 ; Z 2 \W 2 ) + I(X 2 ;Y 2 \V 2 ,W 2 ) 



Thus we see, by comparing term by term, that this achievable region is at least as large as the 
region stated in Theorem [2j and hence the region in Theorem [2] is achievable. 

The converse is also reasonably immediate from the outer bound in HI Observe the following: 

mm{I(W x ; Y x ) + I(W 2 ;Y 2 ), I(W X ; Z x ) + I(W 2 ; Z 2 )} + I(U X ;Y X \W X ) + I(U 2 ;Y 2 \W 2 ) 

< min{I{W x ; Y x ) + I(W 2 ; Y 2 ), I(W X ;Z X ) + I(W 2 ; Z 2 )} + I(U X ; Y X \W X ) + I(X 2 ;Y 2 \W 2 ), 

min{I{W x ;Y x ) + I(W 2 ;Y 2 ), I(W X ; Z x ) + I(W 2 ; Z 2 )} + I(V X ; Z X \W X ) + I(V 2 ; Z 2 \W 2 ) 
{I(W X ;Y X ) + I(W 2 ;Y 2 ),I(W X ;Z X ) + I{W 2 ; Z 2 )} + I(X X ;Z X \W X ) + I(V 2 ; Z 2 \W 2 ), 

mm{I(W x ; Y x ) + I(W 2 ;Y 2 ), I(W X ;Z X ) + I(W 2 ;Z 2 )} + I(U 2 ;Y 2 \W 2 ) + I(X 2 ; Z 2 \U 2 , W 2 ) 
+ min {I{U X ; Y X \W X ) + I{X X - Z X \U X , W x ), I(V X ; Z x \ W x ) + I(X X ;Y X \V X ,W X )} 

< mhx{I(W x ; Y x ) + I(W 2 ;Y 2 ), I(W X ; Z x ) + I(W 2 ; Z 2 )} + I(X 2 ;Y 2 \W 2 ) 
+ min {I{U X ; Y X \W X ) + I{X X ; Z X \U U W x ), I(X X ; Z 1 \W 1 )}, 

mm{I(W x ;Y x ) + I{W 2 ;Y 2 ),I(W X ;Z X ) + I(W 2 ; Z 2 )} + I(V X ;Z X \W X ) + I(X X ;Y X \V X ,W X ) 
+ min {I(U 2 ; Y 2 \W 2 ) + I(X 2 ; Z 2 \U 2l W 2 ), I(V 2 ; Z 2 \W 2 ) + I(X 2 ; Y 2 \V 2l W 2 )} 

< {I(W X ;Y X ) + I{W 2 ;Y 2 ),I(W X ;Z X ) + I{W 2 : Z 2 )} + I(X X ;Z X \W X ) 
+ mm{l(X 2] Y 2 \W 2 ) 1 I(V 2 ;Z 2 \W 2 ) + I(X 2] Y 2 \V 2 ,W 2 )} 

Thus we see, by comparing term by term, that the region stated in Theorem[2]is at least as large 
as the outer bound in Claim [H Hence the region in Theorem [2] is an outer bound, thus completing 
the converse. □ 

Remark 5. The achievable region in ([5]) also matches the outer bound in Claim [4] for a variety of 
other classes. For instance, say Z\ is more capable than Y\ and Yi is a deterministic function of Xi. 
In this case, one can show that the capacity region is given by the union of rate triples satisfying 

R < mm{I(W x ;Y x ) + I(W 2 ;Y 2 ), I(W X ;Z X ) + I(W 2 ; Z 2 )} 
R + Ri< min{I(W x ;Y x ) + I(W 2 ;Y 2 ), I(W X ;Z X ) + I(W 2 ; Z 2 )} + I(U X ;Y X \W X ) + H (Y 2 \W 2 ) 
Ro + R 2 < mm{I(W x ; Y x ) + I{W 2 ; Y 2 ), I(W X ; Z x ) + 1{ W 2 ; Z 2 )} + I{X X ■ Z x | W x ) + I{V 2 ; Z 2 1 W 2 ) 
Ro + Ri+R 2 < mm{I(W x ; Y x ) + I(W 2 ; Y 2 ), I(W X ; Z x ) + I(W 2 ; Z 2 )} 
+ I(V 2 ;Z 2 \W 2 ) + H(Y 2 \V 2 ,W 2 ) + I(X X ;Z X \W X ). 

The details are left to the reader. 
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4 Properties of X-SR(q) 

In this section we discuss some of the properties of X-SR(q) in addition to Claim [1} 
Lemma 2. X-SR(q) is related to the optimal sum rate as follows: 

min X-SR(q) > SR*(q), 

AG{0,1} 

i.e. the minimum value of X-SR, for X = 0, 1 = 1 , yields an upper bound on the optimal sum rate, 
SR*(q) . 

Proof. We prove the statement for I = 0; the proof for I = 1 is similar. We begin by showing 
that for any p(x), 0-SR(q,p(x)) = maXp^ I(W; Z) + I(X;Y\W). This implies that 0-SR(q) = 
maxp^ ,,,) I(W; Z) + I(X; Y\W) which is in turn an upper bound on the optimal sum rate by the 
UV outer bound (replace W by V). To see this first note that by setting V = 0, U = X we obtain 

0-SR(q,p(x)) > max I(W;Z) + I(X;Y\W) 

p(w\x) 

To obtain the other direction, observe that 

0-SR{q,p{x)) = max {l{W; Z) + I{U; Y\W) + I(V; Z\W) — I(U; V\W)} 

p(u,v,w\x) 

= max {I(VW;Z) + I(U;Y\W)-I(U;V\W)} 

p(u,v,w\x) 

= max {I(VW;Z) + I(U;Y\VW)-I(U;V\WY)} 

p(u,v,w\x) 

< max {I{VW;Z) + I{X;Y\VW)} 

p(u,v,w\x) 
= max I(W';Z)+I(X;Y\W'). 

p(w'\x) 

Not that in the last step we replace (V, W) by W. 
Thus we have, as desired, 



0-SR(q,p(x)) = max I(W;Z) + I(X;Y\W). 

p(w\x) 



□ 



Corollary 1. If the minimum value of X-SR(q) is attained at I = or I = 1 then SR(q) = SR*(q), 
i.e. Marton's strategy achieves the optimal sum-rate. 

Proof. This follows from the relationships 

min X-SR(q) = SR(q) < SR*(q) < min X-SR(q). 

Ae[o,i] ag{o,i} 

□ 

Lemma 3. For an given q and p{x), X-SR(q,p(x)) and X-SR(q) are convex in X for X G [0, 1]. 
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Proof. To show that A i— > X-SR(c\,p(x)) is convex, take arbitrary Ai,A2,A3 satisfying A2 = t • 
Take some p(w* , u* , v*\x) maximizing X 2 -SR{q,p{x)). Note that 

X 2 -SR(q,p(x)) = 

{X 2 I(W*; Y) + (1 - X 2 )I(W*; Z) + I(U*;Y\W*) + I(V*; Z\W*) - I{U*; V*\W*)} = 
' {Ai/(VF*; F) + (1 - Ai)/(VF*; Z) + y|W*) + I(V*; Z\W*) - I(U*;V*\W*)}+ 



{X 3 I(W*; Y) + (1 - A 3 )I(W*; Z) + J(C/*; y|W*) + 7(V*; Z\W*) - I(U*; V*\W*)} 

^[Xx-SRfapQc)) + X 3 -SR(q,p(x))]. 
To show that A 1— > X-SR(q) is convex, note that 

maxA 2 -5 J R(q) < max - [Xi-SR(q) + X 3 -SR(q)) < 

p(x) p(x) 2 

max ^Ai-Sft(q) + max ]-X 3 -SR(q) = \ [Ai-Sfl(q) + X 3 -SR{q)] . 

p{x) 2 p(^) 2 2 



< 



□ 



Lemma 4. To compute the X-SR{q), it suffices to consider auxiliary random variables that satisfy 
\U\ < min(|#|, \y\), \V\ < mm(\X\, \Z\), \W\ < \X\. 

Proof. This is proved in Theorem 2 of |GEA10j . □ 

Lemma 5. Take some arbitrary p{x) and real A*. Then for any p(w* , u* , v*\x) maximizing 
X-SR{q,p(x)), the line X ^ (A - X*){I(W*; Y) - I(W*; Z)) + I* -SR{q,p(x)) is a supporting hyper- 
plane to the convex curve X i-> X-SR(q,p(x)). 

Proof. At A = A*, the expression (A - X*){I{W*;Y) - I(W*;Z)) + l*-SR{q,p(x)) is equal to 
l*-SR(q,p(x)) which is a point on the curve A i-> X-SR(q,p(x)). We need to show that for any 
arbitrary A, 

X-SR(q,p(x)) > (A - X*)(I(W*;Y) - I(W*; Z)) + l*-SR(q,p(x)). 
The above inequality holds because it is equivalent with 
X-SR{q,p(x)) > XI(W*;Y) + (1 - X)I(W*;Z) + !(£/*; Y\W*) + I(V*;Z\W*) - I{U*;V*\W*). 

□ 

Lemma 6. X-SR(q,p(x)) is constant in X for less noisy channels, deterministic channels, and 
linear in X for more capable channels. 

Proof. Less Noisy: Assume that Y is less noisy than Z. 

X-SR(q,p{x)) = max [XI (W; Y) + (1 - X)I(W; Z) + I(U;Y\W) + I(V; Z\W) - I(U; V\W)] 

p(u,v,w\x) 

< max [XI{W;Y) + (1- X)I(W;Z) + I(U;Y\W) + I{V;Z\U,W)] 

p(u,v,w\x) 

< max [XI(W;Y) + (1-X)I(W;Z) + I(U;Y\W) + I(V;Y\U,W)] 

p(u,v,w\x) 

< max [XI{W; Y) + (1 - X)I{W; Z) + I(X; Y\W)) 

p(w\x) 

< max [I(W;Y) + I(X;Y\W)) = I{X;Y) 

p(w\x) 
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On the other hand setting W = %, U = X, V = % shows that X-SR(q,p(x)) > I{X;Y). Hence 
X-SR(q,p(x)) is a constant. 

Remark 6. While the capacity region of the product of reversely degraded broadcast channels was 
known; however in this case UV-outer bound is tight since the X-SR is a straight line. 

Deterministic: 

X-SR(q,p(x)) = max [XI{W; Y) + (1 - X)I(W; Z) + I(U;Y\W) + I(V; Z\W) - I(U; V\W)] 

p(u,v,w\x) 

< max [XI{W;Y) + {l-X)I{W;Z)+I{U;Y\W) + I{V;Z\U,W)\ 

p(u,v,w\x) 

< max \XI(W;Y,Z) + (1 - X)I(W;Y,Z) + 1(17; Y,Z\W) + I(V;Y, Z\U,W)] 

p(u,v,w\x) 

<I(X;Y,Z) = H(Y,Z) 

One the other hand setting W = ®, U = Y, V = Z shows that X-SR(q,p(x)) > H)Y,Z). Hence 
X-SR(q,p(x)) is a constant. (Note that these choices of auxiliaries, i.e. U = Y, V = Z, are 
permissible for deterministic channels since (U, V) — > X — > (Y, Z) is a Markov chain.) 
More capable: Assume that Y is more capable than Z. 

X-SR(q,p(x)) = max [XI(W; Y) + (1 - X)I(W; Z) + /([/; Y\W) + J(F; Z|T7) - I(U; V\W)] 

p(u,v,w\x) 

< max [XI{W;Y) + {1- X)I(W;Z)+I{U;Y\W) + I{V;Z\U,W)] 

p(u,v,w\x) 

< max [AJ(W r ;y) + (l-A)/(PF;Z) + J(C/;F|W r ) + J(X;Z|?7,W)] 

p(u,w\x) 

< max [AJ(W;F) + (1-A)/(W;Z) + /(C/;F|W) + J(X;F|C/,W)] 

p(«,ui|x) 

< max [XI {W; Y) + (1 - A)I(W; Z) + /(AT; r|W)] 

p(to|a;) 

= /(AT; Y) + (l — A) max (J(W; Z) - I{W\ Y)). 

p(w\x) 

On the other hand setting U = X, V = constant shows that X-SR(q,p(x)) > I(X;Y) + (1 — 
A) m&x p ( w \ x )(I(W; Z) — I(W;Y)). Hence X-SR(q,p(x)) is linear in A. □ 
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A A Min- Max Theorem 

Theorem 3 (Theorem 3 of |Ter72| ) . Let X be a compact connected space, let Y be a set, and let 
f : X x Y I—?- M be a function satisfying: 

(i) For any 1/1,1/2 € Y there exists yo £Y such that 

f{x, yo ) > g (f( x > ^) + 2/2)) , Vx 6 X. 

(ii) Every finite intersection of sets of the form {x € X : f{x,y) < a)} with (y,a) € Y x R is 
closed and connected. 

Then 

sup min /(x, y) = min sup f(x, y). 

We now present a Corollary of the above theorem that can be potentially used in many infor- 
mation theory scenarios. 

Corollary 2. Let be the d-dimensional simplex, i.e. Aj > and Yli=i Ai = 1. Let V be a set 
of probability distributions p(u). Let Ti(p(u)),i = 1, ..,d be a set of functions such that the set A, 
defined by 

A = {(ai, «2) ad) £ M rf : a, < Ti(p(u)) for some p(u) € V}, 

is a convex set. 
Then 

d d 
sup min > ^XjTj(p(u)) = min sup > XjTj(p(u)). 
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Proof. Let f(l,p(u)) = Yli=i ^iTi(p{u)). It suffices to verify that f(l,p(u)) satisfies the conditions of 
Theorem [3j Since the set A is convex, we know that for any pi(u),p 2 (u) € P we have a distribution 
p c (u) € V such that 

Ti( Pc (u)) > ^(T i (p 1 (u))+T i (p 2 (u))),i = l,...,d. 
Hence (using linearity in A and non- negativity of Aj) we have 

f(l,Pc(u)) > ^(/(/,pi(«)) + /(Z,P2(«))),VA€ Ad. 

Since f(l,p(u)) is a linear function of A, it is immediate that the set 

B(p(u),a) = {X £ A d : f(l,p(u)) < a} 

is closed for every pair (p(u), a) € V x R. Further, due to the linearity in A, if Ai, A2 € B(p(u), a), 
then the line segment joining Ai and A2 belongs to B(p(u), a). This implies that a finite intersection 
of sets, each containing Ai and A2 will also contain the line segment joining Ai and A2, showing 
that the finite intersection will be connected. Therefore finite intersections of the sets of the form 
B(p(u),a) are closed and connected. Thus the Corollary [2] follows from Theorem [3j □ 

We will now show how one can use the Corollary [2] to establish Claim [TJ 

Proof. (Proof of Claim [JJ) It is clear that 

max min X-SR(q,p(u,v,w,x)) < max min max X-SR(q,p(u, v, w, x)) 

p(u,v,w,x) Ag[0,1] p(%) Ae[0,l] p(u,v,w\x) 

< min max X-SR(q,p(u, v, w, x)). 

Ae[0,l] p(u,v,w,x) 

Therefore suffices to show that 

max min X-SR(q,p(u,v,w,x)) = min max X-SR(q,p(u,v,w,x)). 

p(u,v,w,x) AG[0,1] Ag[0,1] p(u,v,w,x) 

Here we take d = 2 and set 

Txipiu, v, w, x)) = I(W; Y) + I(U; Y\W) + I(V; Z\W) - I{U; V\W) 
T 2 (p(u, v, w, x)) = I(W; Z) + I(U;Y\W) + I(V; Z\W) - I(U; V\W) 

It is clear that the set 

A = {(ai,a 2 ) : ai < ?i(p(u, v, w, x)), a 2 < T 2 (p(u,v,w,x))} 

is a convex set. (In the standard manner, choose W = (W, Q), and when Q = choose (U, V, W, X) ~ 
Pi(u, v, w, x) and Q = 1 choose (U, V, W, X) ~ p 2 (u, v, w, x)). Hence from Corollary [21 we have the 
proof of Claim [TJ □ 

Remark 7. The proof of this claim in section 3.1.1 of [GEA10J is very similar in flavor and uses the 
convexity of the set A. However here we recover it from an application of some general theorems, 
and this technique and Corollary [2] may be helpful in other situations as well. 
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B The F function for the semi-deterministic channel of Figure [T] 



B.l Maximum of F is obtained at the uniform input distribution 

Consider the semi-deterministic channel of Figure [TJ In this appendix we show that for any A € 
[0, 1], X-SR{q,p{x)) is less than or equal to X-SR(q,u(x)) where u is the uniform distribution on 
X. 

Note that \-SR(q,p(x)) is concave in p(x). To see this, take two marginal distributions po(x) 
and pi(x), and assume that (Uo,Vo,Wo, Xq) and (Ui, V±, W\,Xi) are two set of random variables 
maximizing the expressions of X-SR(q,po(x)) and X-SR(q,pi(x)) respectively. Take a uniform 
binary random variable Q, independent of all previously defined random variables. Let U = Uq, 
V = V Q , W = (Wq,Q), X = Xq. Observe that X is distributed according to + SlM, 
and furthermore if we compute the expression in X-SR(q,p(x)) for (U,V,W, X), we get a value 
that is greater than or equal to the average of the corresponding values for (Uq, Vq, Wo, Xq) and 
(UuV^WitXi). Therefore 

X-SR(q, P0 (x)) + X-SR(q, Pl (x)) < X-SR(q, + ^). 

Thus, X-SR(q,p(x)) is concave inp(x). 

Take an arbitrary p(x) of the form ~P(X) = (a, b, c, d). Because of the symmetries of the channel 
of Figure [1] with respect to the two receivers, we have 

X-SR(q,p(x) ~ (a, b, c, d)) = X-SR(q,p(x) ~ (b,a,d,c)) 

= X-SR(q,p(x) ~ (c,d,a,b)) 
= X-SR(q,p(x) ~ (d, c, b, a)). 

Here we have used the symmetry between inputs 1 and 2, and the symmetry between inputs 3 and 
4, and the symmetry between the pair of inputs (1, 2) and (3, 4). Using the concavity of F, we have 

4X-SR(q,p(x) ~ (a, b, c, d)) = X-SR(q,p(x) ~ (a, b, c, d)) + X-SR(q,p(x) ~ (6, a, d, c))+ 

X-SR(q,p(x) ~ (c, d, a, b)) + X-SR(q,p(x) ~ (d, c, b, a)) 

< 4X-SR(q,p(x) ~ (-(a + 6 + c + d),-(o + 6 + c + d), 



-(a + 6 + c + d), -(a + 6 + c + d))) 



4A-5i2(q,«(x)). 



B.2 Computing the A-sum rate at the uniform input distribution 

In this appendix we compute X-SR(q, u(x)) at the uniform input distribution for the semi-deterministic 
channel given in Figured! 

Claim 5. The X i— >• X-SR(q,u(x)) curve for the channel under consideration consists of two lines, 

X-SR(q,u(x)) = 



2 A Ae[0,i] 



3 3" " ^- L ' 2 
4 
3 



AG [1,1] 
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Proof. Note that 



X-SR(q, u{x)) = max { XI (W; Y) + (l- X)I(W; Z) + I(U; Y\W) + I(V; Z\W) - I(U; V\W)} 

p(u,v,w\x) 

= max {XI(W; Y) + (1 - X)I(W; Z) + I(U; Y\W) + H(Z\UW)}. 

p(u,w\x) 

In the last step we have used the inequality I(V;Z\W) — I(U;V\W) < H(Z\UW) together with 
the fact that I(Z; Z\W) - I(U; Z\W) = H(Z\UW). Therefore X-SR(q, u{x)) can be written as 

max {XH(Y) + (1 - X)H(Z) + (1 - X)(H(Y\W) - H{Z\W)) + H(Z\UW) - H(Y\UW)}, 

p(u,w\x) 

which is equal to 

XH(Y) + (1 - X)H(Z) + max {(1 - A) (H(Y\W) - H(Z\W)) + 

p(w\x) 

max (H(Z\UW) -H{Y\UW))}. (6) 

p(u\w,x) 

Let P(X|VF = i) = (ai,bi,Ci,di), and f(a,i,bi,a,di) = max p ( u | x ) H(Z\U) — H(Y\U) conditioned 
on P(X) = (cii,bi,Ci,di). Observe that / is concave. The argument is similar to the one given 
above in the first part of this appendix and we will not repeat it here. Further, observe that 
f(ai,bi,Ci,di) = f(bi,di,di,Ci) because the symmetry between inputs 1 and 2, and the symmetry 
between inputs 3 and 4. 

Consider the transformation (di,bi,Ci,di) — > (6j, Oj, dj, q), for all i while leaving P(W = i) 
unchanged. This preserves expression in equation ([6]) because of the symmetry between inputs 
1 and 2, and the symmetry between inputs 3 and 4. Thus the transformation (oj, 6j, Cj, di) — > 
( a '+ b ' , * , i , Ci ~^ di ), for all i while leaving P(W = i) unchanged, does not decrease the A-sum 
rate since H(Y\W) and / are concave functions in (cij, 6j, q, di), and H(Z\W) that appears with 
a negative sign remains constant under this transformation. Therefore without loss of generality 
assume that P(X\W = i) = (^f, ^f, ^p-, — ip*) when optimizing the expression in equation ©. Let 
P(W = i) = Wi. Then we require w i x i = \ ■ 

Hence we can work out the expression as the maximum over wi , Xj satisfying the above constraint 

of 

Alog6 + (l-A) + (l-A)^ W aiog3 + ^-^(x l ,l-xO] + ^^/(f,|,^,^). 

i i 

We now compute /(^, ht 1 )- Observe that 

HiZ) - H(Y) = H{a + b,c + d) - H (2±* , £ + £, 2+* + * £ + !?) 

§ H(a + b ,c + d )-H('-^, 2±|±* 2. *±§±* », £ + 4) 

W rr , . ,s m a + & a + 6 + c + d a + 6 c + d c + d. 
<H(a + b,c + d)-H(—, ,_ ) 

= ]-H(a + b,c + d) - log 3. 

The step (a) holds because the expression is convex in c and d once we fix c + d, therefore its 
maximum must occur at the boundaries. The step (b) holds because the expression is convex in a 
and b once we fix a + b, therefore its maximum must occur at the boundaries. 
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Therefore H(Z)-H(Y) < \H{a+b, c+d)-\og 3 for all permissible (a, b, c, d). Since the function 
\H{a + b,c + d) — log 3 is concave, we conclude that /(a,, 6j, Cj, di) < \H(a + b,c + d) — log 3 for all 
permissible (a, b, c, d). Hence, at (a, b, c, d) = (5f , ?f , ^p 1 , we have 

/(y,y,— ^— »— 2~- ) - 3^(^,1 -^) - log 3. 

The equality can be indeed achieved by taking with probability half (0,Xj,0, 1 — x«) and with 
probability half (x;,0, 1 - Xj,0). Thus, f-, = \H(xi, I - Xi) - log 3. 

Substituting this in we get 

1 + (1 - \)l + (1 - |(1 - A)) J] ^if(x i5 1 - xO. 

i 

We need to maximize this subject to ^WiXi = ^. Clearly when (1 — A) < 5 the optimal choice is 
to set Xj = 2- hr the other interval, it is optimal to set x% = w.p. \ and x, = 1 w.p. \. In this 
case we get 1 + (1 — A)|. □ 

C Proof of the outer bound (Claim |4]) 

We wish to show that the union over all px(wx, v x , ui, Xi)p2(w2, v%, u 2 , x 2 ) of triples (Rq, R\, R2) 
satisfying 

R < min{7(W i; Y x ) + I(W 2 ;Y 2 ), I(Wx;Zx) + I(W 2 ; Z 2 )} 
R + Ri< min{I(Wi; Yi) + I(W 2 ;Y 2 ), I{Wx\Zx) + I{W 2 : Z 2 )} + I{Ux; Y x \Wx) + I{U 2 ; Y 2 \W 2 ) 
R + R 2 < min{I(Wi; Yi) + I(W 2 ; Y 2 ), I{Wx\Zx) + I(W 2 ; Z 2 )} + I{Vv, Z X \W X ) + I(V 2 ; Z 2 \W 2 ) 
Ro + R x + R 2 < min{J(Wi; Y x ) + I{W 2 ;Y 2 ), I{W X ;Z X ) + I(W 2 , Z 2 )} 
+ I(U 2 ;Y 2 \W 2 ) + I(X 2 ; Z 2 \U 2 , W 2 ) 

+ min {l{U x ; Yi |Wi) + I{X X ; Z X \U X , W x ), I(V X ;Z X \W X ) + I(X X ; Y X \V X , Wi)}, 
R + R x + R 2 < min{I(Wi; Y x ) + I(W 2 ; Y 2 ), I(W X ;Z X ) + I{W 2 : Z 2 )} 

+ min {7(t/ 2 ; Y 2 |W 2 ) + Z(X 2 ; Z 2 \U 2 , W 2 ), I(V 2 ; Z 2 \W 2 ) + I{X 2 - Y 2 \V 2 , W 2 )} 
+ I(V X ;Z X \W X ) + I(X X ;Y X \V X ,W X ), 

forms an outer bound to a product broadcast channel. 

Proof. Take a code of length n. Let Q be a random variable independent of the code book such 
that Q is uniform in [1 : n]. Identify 

Wx = (M , Z^Y®- 1 , Z$ +1 , Q),W 2 = (M , Y™, Y^" 1 , Z£ +1 , Q), U 1 = U 2 = M 1 , 
V\ = V2 = M 2 , Xi = Xiq,X 2 = X 2 q. 

We need to verify that these choice of auxiliaries work. We begin with the sum rate. The 
manipulations on the sum-rate are the most unconventional, while the rest are quite standard. 
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Using the Fano inequality, for any A € [0, 1], we can write 
n(R a + Ri + R 2 ) - nei n 

< A/(M ; Y^,Y 2 \) + (1 - A)J(M ; Z^,Z^) + I (Mi ; Y{[ , Y 21 | M ) + I(M 2 ; Z^,Z^\M ) - I(M X ; M 2 \M ) 
= A/(M ; Y 2 \ \Y£) + (1 - A)/(M ; + /(Mi; |M 0) 1ft) + /(M 2 ; 2"^, M ) 

+ AJ(M ; F/l) + (1 - A)/(M ; Z[\\Z> 2 \) + I(M X ;Y&\M Q ) + I(M 2 ; Z[\\Z^M Q ) - I(M X ;M 2 \M , Z 2 \) 

< XI{M ;Y 2 \\Y^) + (1 - A)/(Mo ) y 1 " 1 ;2 2 " 1 )+7(M 1 ;F«|M 0l F 1 " 1 ) +/(M 2 ;^ 1 |M 1 ,M ,F 1 " 1 ) 

+ A/(M , Z^ift) + (1 - A)/(M ; ^\|^) + /(Mi; 3^|M 0j + J(M 2 ; Z&I^Mo) - I(M X ;M 2 \M , Z 2 \) 

< A/(M ,r^;r 2 1) + (1 - ^/(Mo,^;^) +/(M 1 ;y 2 " 1 |M ,y 1 n 1 ) +/(X 2 \;^\|M 1 ,M ,y i " 1 ) (7) 
+ A/(M , Z&jlft) + (1 - A)/(M , Z"; + I{M X ;Y? X \M , ^) + /(M 2 ; ZftlM), Z£) - /(Mi; M 2 |M , 

where ei n is a function that converges to zero as n — > oo. 
Using the two inequalities stated below 

7(Mi ; Ffi | Mo , 2 2 "i ) + /( M 2 ; | M , 2£ ) - /(Mi ; M 2 1 M , 2% x ) < I(M X ; Y£ \ M , M 2 , Z 2 \ ) + I(M 2 ; ^ | M , Z 2 \ ) 
/(Mi ; Fi™ | Mo , Z^ ) + /(M 2 ; ZJ\ | M , 2£ ) - I(M X - M 2 1 M , ) < I(M X ; Y{[ \ M , Z& ) + /(M 2 ; ZJ\ | M , Mi Zft ) 

using we obtain the following two constraints on the sum-rate. 

n(R + Ri + R 2 ) - ne ln 

< A/(M ,Fi"i;F 2 n i) + (1 - A)/(M , Y" ± ; Z 21 ) + I(Mi;Y 21 \M , Y{\) + I(X 21 ; Z 21 \M X , Mo, Y" x ) (8) 
+ A/(M , Z&iYR) + (1 - A)/(Mo,2 2 "i;2i"i) + I(X^Y{{\M , M 2 , Z 2 \) + I(M 2 ; Z[\\M , Z^) 



n(Ro + Ri + R 2 ) - ne ln 

< A/(M ,Fi"i;F 2 "i) + (1 - A)/(M , 1ft; Z 21 ) + I(M X ;Y 21 \M , 1ft) + I(X 21 ; Z 21 \M X , Mo, 1ft) (9) 
+ A/(M , -Z 2 " i; lft) + (1 - A)/(M ,2 2 "i;2i"i) + /(Mi;lft|Mo, 2£) + I(X[\; Z[\\M , M Xt Z&) 

Starting from ([8]), standard manipulations as in [El 79j or |NE07j will yield us 
n(Ro + Ri + R 2 ) — nein 

n 

< A/(M , 1ft, Y 2 \\ Z 2 ? +1 ; Y 2l ) + (1 - A)/(M , 1ft, Y£\ Z 2 ? +1 ; Z 2i ) 

i=l 

+ I(Mi; Y 2l | M , 1ft , Y£ 1 , Z 2 ? +1 ) + I(X 2i ; Z 2 \ | Mi , M , 1ft , F 2 ft 1 , Z£ +1 ) 
+ A/(M , Z», ^r 1 , Z^ + i; Yh) + (1 - A)/(M 0) ^^f 1 . Zn) 
+ I(X xi ; Y U \M , M 2 , Z^,Y;-\ Z^ +1 ) + /(M 2 ; Z Xi \M , Z^,Y^\ Z^ +l ) 

This leads to the single letter bound, using our identification of the auxiliaries, 

R + Ri+R 2 < A/(W 2 ;F 2 ) + (1 - A)/(W 2 ; Z 2 ) + I(U 2 ; Y 2 \W 2 ) + I(X 2 ; Z 2 \U 2 , W 2 ) 

+ XI(Wi;Yi) + (1 - \)I{Wi-Zi) + I(Vf, Z X \W X ) + IiX^Vi, W x ). 

Starting from ©, similar manipulations will yield us the single letter bound 

Ro + Ri+R 2 < XI{W 2 ;Y 2 ) + (1 - X)I(W 2 ;Z 2 ) + I(U 2 ; Y 2 \W 2 ) + I(X 2 ; Z 2 \U 2 , W 2 ) 

+ XI(Wi',Yl) + (1 - A)/(Wi; Z x ) + I(U X ;Y X \W X ) + I(X X ; Z X \U X , W x ). 
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Thus we have the bound 

Ro + Ri + R 2 < XI(W 2 :Y 2 ) + (1 - A)/(W 2 ; Z 2 ) + I(U 2 ; Y 2 \W 2 ) + I(X 2 ; Z 2 \U 2 , W 2 ) 

+ A/(Wj ; Y x ) + (1 - A)/(Wi ; Z x ) + min { I{U X ; Y 1 \ W x ) + I(X 1 ; Z x \ U x , W x ) , 
I(y x ;Z x \W x )+l{X x ;Y x \V x ,W x )}. 

Since the auxiliaries do not depend on A we can evaluate the bounds at A = 0, A = 1 to yield 

Ro + Ri + R 2 < min{J(Wi;yi) + I(W 2 ;Y 2 ), I(W X ; Z x ) + I{W 2 : Z 2 )} + I(U 2 ;Y 2 \W 2 ) + I(X 2 ;Z 2 \U 2 , W 2 ) 
+ min {I{U X \ Y X \W X ) + I(X X ; Z 1 \U 1 , Wi), I(Vr, Z X \W X ) + I(X X ; Y^Vi, Wi)}, 

Interchanging the roles Y{{ o , Y£\ <-> Z\\ , M 2 f> Mi,A « 1-A in @ and following a 
similar procedure we obtain 

R a + R x +R 2 < min{J(Wi;yi) + /(W 2 ; K 2 ), /(Wi; Zi) + I(W 2 ; Z 2 )} 
+ min {/(C/ 2 ; F 2 |W 2 ) + I{X 2 ; Z 2 \U 2 , W 2 ), 

I(V 2 ;Z 2 \W 2 )+ I(X 2 ; Y 2 \ V 2 ,W 2 )} 
+ I(V X ; Z X \W X ) + I{X X ;Y X \V X ,W X ). 



Remark 8. An observant reader may wonder whether we forgot another set of manipulations that 
would help us combine the two sum-rate bounds into the following term 

Ro + R\ + R 2 
< min{/(Wi; Y x ) + I{W 2 ;Y 2 ), /(Wi; Z x ) + I{W 2 ; Z 2 )} 
+ min {I(U 2 ; Y 2 \W 2 ) + I(X 2 ; Z 2 \U 2 , W 2 ), I(V 2 ;Z 2 \W 2 ) + I(X 2 ; Y 2 \V 2 , W 2 )} 
+ mm{l(V x ;Z x \W x ) + I(X 1 ;Y 1 \V x ,W x ) 1 I(U 1 ;Y 1 \W x )+I(X 1 ;Z x \U 1 ,W 1 )}. 

However this is not the case. Not only does our choice of auxiliaries not yield this, this term is 
not even an outer bound (for the product of reversely more capabe channels). We leave the details 
to the reader. 

It remains to verify the following inequalities 

Ro <I(W X ;Y X )+I{W 2 ;Y 2 ), 

R <I{W 1 :Z 1 ) + I(W 2 -Z 2 ) 1 
Ro + Ri< /(Wi; yi) + I(W 2 ;Y 2 ) + I(U X ; Y X \W X ) + I{U 2 ; Y 2 \W 2 ), 
Ro + R x <I{W 1 ;Z 1 ) + I{W 2 ; Z 2 ) + I{U X ;Y X \W X ) + I{U 2 ; Y 2 \W 2 ), 
Ro + R 2 < I(W X ; Y x ) + I{W 2 ;Y 2 ) + I{y x -Z x \W x ) + I(V 2 ; Z 2 \W 2 ), 
Ro + R 2 < I(W X ;Z X ) + /(W 2 ; Z 2 ) + I(V X ; Z X \W X ) + I(V 2 ; Z 2 \W 2 ). 

Of these the first, second, third, and sixth are really straightforward from standard manipulations 
for our choice of auxiliaries. Hence we show only the fourth term. The fifth term follows in a similar 
fashion as the fourth term. 
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To get the fourth inequality, from Fano's inequality we have 

(Ro + Ri) -e 2n < -(/(A/ ;Z™ ,Z™) + /(A/ 1 ;r l I i,y™|A/ )) 
n 

< 1 (J(M , Zfo + /(Mo, ^1; Z|\) + /(Mi; Y^Mo, Z») + /(Mi; |M , 1^)) 

i=l 

+ J(Mi; Y u |Mo, Z^, F/f 1 ) + /(M^F^lMo, YTi, 2 2 " i+1 , YJr 1 ) 

< /(Wi; Z x ) + /(W 2 ; Z 2 ) + J(t/i; Yi|Wi) + I(U 2 ;Y 2 \W 2 ), 

as desired. Here (a) again follows from standard manipulations. This completes the proof of the 
outer bound for product channels. □ 
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